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Abstract 



We propose a perspective on knowledge compilation which calls for analyzing different com- 
pilation approaches according to two key dimensions: the succinctness of the target compilation 
language, and the class of queries and transformations that the language supports in polytime. 
We then provide a knowledge compilation map, which analyzes a large number of existing tar- 
get compilation languages according to their succinctness and their polytime transformations and 
queries. We argue that such analysis is necessary for placing new compilation approaches within 
the context of existing ones. We also go beyond classical, flat target compilation languages based 
on CNF and DNF, and consider a richer, nested class based on directed acyclic graphs (such as 
OBDDs), which we show to include a relatively large number of target compilation languages. 

1. Introduction 

Knowledge compilation has emerged recently as a key direction of research for dealing with the 
computational intractability of general propositional reasoning (Darwiche, 1999; Cadoli & Donini, 
1997; Boufkhad, Gregoire, Marquis, Mazure, & Sais, 1997; Khardon & Roth, 1997; Selman & 
Kautz, 1996; Schrag, 1996; Marquis, 1995; del Val, 1994; Dechter & Rish, 1994; Reiter & de 
Kleer, 1987). According to this direction, a propositional theory is compiled off-line into a target 
language, which is then used on-line to answer a large number of queries in polytime. The key 
motivation behind knowledge compilation is to push as much of the computational overhead into 
the off-line phase, which is amortized over all on-line queries. But knowledge compilation can serve 
other important purposes as well. For example, target compilation languages and their associated 
algorithms can be very simple, allowing one to develop on-line reasoning systems for simple software 
and hardware platforms. Moreover, the simplicity of algorithms that operate on compiled languages 
help in streamlining the effort of algorithmic design into a single task: that of generating the smallest 
compiled representations possible, as that turns out to be the main computational bottleneck in 
compilation approaches. 

There are three key aspects of any knowledge compilation approach: the succinctness of the 
target language into which the propositional theory is compiled; the class of queries that can be 
answered in polytime based on the compiled representation; and the class of transformations that 
can be applied to the representation in polytime. The Al literature has thus far focused mostly on 
target compilation languages which are variations on DNF and CNF formulas, such as Horn theories 
and prime implicates. Moreover, it has focused mostly on clausal entailment queries, with very little 
discussion of tractable transformations on compiled theories. 

The goal of this paper is to provide a broad perspective on knowledge compilation by considering 
a relatively large number of target compilation languages and analyzing them according to their 
succinctness and the class of queries/transformations that they admit in polytime. 
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Instead of focusing on classical, flat target compilation languages based on CNF and DNF, we 
consider a richer, nested class based on representing propositional sentences using directed acyclic 
graphs, which we refer to as NNF. We identify a number of target compilation languages that have 
been presented in the AI, formal verification, and computer science literature and show that they 
are special cases of NNF. For each such class, we list the extra conditions that need to be imposed 
on NNF to obtain the specific class, and then identify the set of queries and transformations that the 
class supports in pol}d;ime. We also provide cross-rankings of the different subsets of NNF, according 
to their succinctness and the polytime operations they support. 

The main contribution of this paper is then a map for deciding the target compilation language 
that is most suitable for a particular application. Specifically, we propose that one starts by iden- 
tifying the set of queries and transformations needed for their given application, and then choosing 
the most succinct language that supports these operations in polytime. 

This paper is structured as follows. We start by formally defining the NNF language in Section 2, 
where we list a number of conditions on NNF that give rise to a variety of target compilation languages. 
We then study the succinctness of these languages in Section 3 and provide a cross-ranking that 
compares them according to this measure. We consider a number of queries and their applications in 
Section 4 and compare the different target compilation languages according to their tractability with 
respect to these queries. Section 5 is then dedicated to a class of transformations, their applications, 
and their tractability with rc!spc;ct to the different target compilation languages. We finally close in 
Section 6 by some concluding remarks. Proofs of all theorems are delegated to Appendix A. 

2. The NNF Language 

We consider more than a dozen languages in this paper, all of which are subsets of the NNF language, 
which is defined formally as follows (Darwiche, 1999, 2001a). 

Definition 2.1 Let PS be a denumerable set of propositional variables. A sentence in WFps is 

a rooted, directed acyclic graph (DAG) where each leaf node is labeled with true, false, X or -^X , 
X G PS; and each internal node is labeled with A or V and can have arbitrarily many children. 
The size of a sentence S in NNFpg, denoted \T, \, is the number of its DAG edges. Its height is the 
maximum number of edges from the root to some leaf in the DAG. 

Figure 1 depicts a sentence in NNF, which represents the odd parity function (we omit reference 
to variables PS when no confusion is anticipated). Any propositional sentence can be represented 
as a sentence in NNF, so the NNF language is complete. 

It is important here to distinguish between a representation language and a target compilation 
language. A representation language is one which we expect humans to read and write with some 
ease. The language of CNF is a popular representation language, and so is the language of Horn 
clauses (especially when expressed in rules form). On other hand, a target compilation language does 
not need to be suitable for human specification and interpretation, but should be tractable enough 
to permit a non-trivial number of polytime queries and/or transformations. We will consider a 
number of target compilation languages that do not qualify as representation languages from this 
perspective, as they are not suitable for humans to construct or interpret. We will also consider a 
number of representation languages that do not qualify as target compilation languages.^ 

A formal characterization of representation languages is outside the scope of this paper. But for 
a language to qiialify as a target compilation language, we will require that it permits a polytime 
clausal entailment test. Note that a polytime consistency test is not sufficient here, as only one 
consistency test on a given theory does not justify its compilation. Given this definition, NNF does 

1. It appears that when proposing target compilation languages in the AI literature, there is usually an implicit 
requirement that the proposed language is also a representation language. As we shall see later, however, the 
most powerful target compilation languages are not suitable for humans to specify or interpret directly. 



230 



A Knowledge Compilation Map 



Decomposability 




-A B -iB A C -iD D -,C 



Determinism 




-,AB-iBA C-,DD-,C 



Smootliness 




nAB-iBA C-iDD-iC 



Figure 1: A sentence in NNF. Its size is 30 and height is 4. 



not qualify as a target compilation language unless P=NP (Papadimitriou, 1994), but many of its 
subsets do. Wc define a number of these subsets below, each of which is obtained by imposing 
further conditions on NNF. 

We will distinguish between two key subsets of NNF: flat and nested subsets. We first consider 
flat subsets, which result from imposing combinations of the following properties: 

• Flatness: The height of each sentence is at most 2. The sentence in Figure 3 is flat, but the 

one in Figure 1 is not. 

• Simple-disjunction: The children of each or-node are leaves that share no variables (the 
node is a clause). 

• Simple-conjunction: The children of each and-node are leaves that share no variables (the 
node is a term). The sentence in Figure 3 satisfies this property. 

Definition 2.2 The language f-NNF is the subset of NNF satisfying flatness. The language CNF is 
the subset o/f-NNF satisfying simple-disjunction. The language DNF is the subset o/f-NNF satisfying 
simple-conjunction. 

CNF does not permit a polytime clausal entailment test (unless P=NP) and, hence, does not qualify 
as a target compilation language. But its dual DNF does. 

The following subset of CNF, prime implicates, has been quite influential in computer science: 

Definition 2.3 The language PI is the subset of CNF in which each clause entailed by the sentence 
is subsumed by a clause that appears in the sentence; and no clause in the sentence is subsumed by 

another. 

A dual of PI, prime impUcants IP, can also be defined. 

Definition 2.4 The language IP is the subset of DNF in which each term entailing the sentence 
subsumes some term that appears in the sentence; and no term in the sentence is subsumed by 
another term. 

There has been some work on representing the set of prime implicates of a propositional theory in 

a compact way, allowing an exponential number of prime implicates to be represented in polynomial 
space in certain cases — see for example the TRIE representation in (de Kleer, 1992), the ZBDD 
representation used in (Simon & del Val, 2001), and the implicit representation based on meta- 
products, as proposed in (Madre & Coudert, 1992). These representations are different from the 
language PI in the sense that they do not necessarily support the same queries and transformations 
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that we report in Tables 5 and 7. They also exhibit different succinctness relationships than the 
ones we report in Table 3. 

Horn theories (and renamable Horn theories) represent another target compilation subset of CNF, 
but we do not consider it here since we restrict our attention to complete languages L only, i.e., we 
require that every prepositional sentence is logically equivalent to an element of L. 

We now consider nested subsets of the NNF language, which do not impose any restriction on 
the height of a sentence. Instead, these subsets result from imposing one or more of the following 
conditions: decomposability, determinism, smoothness, decision, and ordering. We start by defining 
the first three properties. Prom here on, if C is a node in an NNF, then Vars{C) denotes the set of 
all variables that label the descendants of node C. Moreover, if S is an NNF sentence rooted at C, 
then Vars{T,) is defined as Vars{C). 

• Decomposability (Darwiche, 1999, 2001a). An NNF satisfies this property if for each con- 
junction C in the NNF, the conjuncts of C do not share variables. That is, if Ci, . . . , C„ are 
the children of and-nodc C, then Vars(Ci) O Vars{Cj) = for z 7^ j. Consider the and-node 
marked in Figure 1(a). This node has two children, the first contains variables A,B while the 
second contains variables C, D. This and-node is then decomposable since the two children do 
not share variables. Each other and-node in Figure 1(a) is also decomposable and, hence, the 
NNF in this figure is decomposable. 

• Determinism (Darwiche, 2001b): An NNF satisfies this property if for each disjunction C 
in the NNF, each two disjuncts of C are logically contradictory. That is, if Ci , . . . , C„ are 
the children of or-node C, then Ci A Cj |= false for i ^ j- Consider the or- node marked 
in Figure 1(b), which has two children corresponding to sub-sentences -^A A B and -^B A A. 
The conjunction of these two sub-sentences is logically contradictory. The or-node is then 
deterministic and so are the other or-nodes in Figure 1(b). Hence, the NNF in this figure is 
deterministic. 

• Smoothness (Darwiche, 2001b): An NNF satisfies this property if for each disjunction C in the 
NNF, each disjunct of C mentions the same variables. That is, if Ci, . . . , C„ are the children of 
or-node C, then Vars{Ci) = Vars{Cj) for i ^ j. Consider the marked or-node in Figure 1(c). 
This node has two children, each of which mentions variables A,B. This or-node is then 
smooth and so are the other or-nodes in Figure 1(c). Hence, the NNF in this figure is smooth. 

It is hard to ensure decomposability. It is also hard to ensure determinism while preserving decom- 
posability. Yet any sentence in NNF can be smoothed in polytime, while preserving decomposability 
and determinism. Preserving flatness, however, may blow-up the size of given NNF. Hence, smooth- 
ness is not that important from a complexity viewpoint unless we have flatness. 

The properties of decomposability, determinism and smoothness lead to a number of interesting 
subsets of NNF. 

Definition 2.5 The language DNNF is the subset o/NNF satisfying decomposability; d-NNF is the sub- 
set satisfying determinism; s-NNF is the subset satisfying smoothness; d-DNNF is the subset satisfying 
decomposability and determinism; and sd-DNNF is the subset satisfying decomposability, determinism 
and smoothness. 

Note that DNF is a strict subset of DNNF (Darwiche, 1999, 2001a). The following decision property 
comes from the literature on binary decision diagrams (Bryant, 1986). 

Definition 2.6 (Decision) A decision node N in an NNF sentence is one which is labeled with true, 
false, or is an or-node having the form (X A a) V {-^X A /?), where X is a variable, a and (3 are 
decision nodes. In the latter case, dVar{N) denotes the variable X. 

Definition 2.7 The language BDD is the set ofHHF sentences, where the root of each sentence is a 
decision node. 
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Figure 2: On the left, a sentence in the BDD language. On the right, its corresponding binary decision 
diagram. 



The NNF sentence in Figure 2 belongs to the BDD subset. 

The BDD language corresponds to binary decision diagrams (BDDs), as known in the formal 
verification literature (Bryant, 1986). Binary decision diagrams arc depicted using a more compact 
notation though: the labels true and false are denoted by 1 and 0, respectively; and each decision 

or 

and and 

node is denoted by « K The BDD sentence on the left of Figure 2 corresponds to the 

binary decision diagram on the right of Figure 2. Obviously enough, every NNF sentence that satisfies 
the decision property is also deterministic. Therefore, BDD is a subset of d-NNF. 

As we show later, BDD does not qualify as a target compilation language (unless P=NP), but the 
following subset does. 

Definition 2.8 FBDD is the intersection o/DNNF and BDD. 

That is, each sentence in FBDD is decomposable and satisfies the decision property. The FBDD language 
corresponds to free binary decision diagrams (FBDDs), as known in formal verification (Gergov & 
Meinel, 1994a). An FBDD is usually defined as a BDD that satisfies the read-once property: on 
each path from the root to a leaf, a variable can appear at most once. FBDDs are also known as 
read-once branching programs in the theory literature. Imposing the read-once property on a BDD 
is equivalent to imposing the decomposability property on its corresponding BDD sentence. 

A more influential subset of the BDD language is obtained by imposing the ordering property: 

Definition 2.9 (Ordering) Let < be a total ordering on the variables PS. The language OBDD< 
is the subset of FBDD satisfying the following property: if N and M are or-nodes, and if N is an 
ancestor of node M, then dVar{N) < dVar{M). 

Definition 2.10 The language OBDD is the union of all OBDD< languages. 

The OBDD language corresponds to the well-known ordered binary decision diagrams (OBDDs) 
(Bryant, 1986). 

Our final language definition is as follows: 

Definition 2.11 MODS is the subset ofDW where every sentence satisfies determinism and smooth- 
ness. 
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or 



and and and and 




X Y Z ^ -lY -2 



Figure 3: A sentence in language MODS. 



NNF 




Figure 4: The set of DAG-bascd languages considered in this paper. An edge Li L2 means that 
Li is a proper subset of L2. Next to each subset, we list the polytime queries supported 
by the subset but not by any of its ancestors (see Section 4). 



Figure 3 depicts a sentence in MODS. As we show later, MODS is the most tractable NNF subset we 
shall consider (together with OBDD<). This is not surprising since from the syntax of a sentence in 
MODS, one can immediately recover the sentence models. 

The languages we have discussed so far are depicted in Figure 4, where arrows denote set inclusion. 
Table 1 lists the acronyms of all of these languages, together with their descriptions. Table 2 lists 
the key language properties discussed in this section, together with a short description of each. 

3. On the Succinctness of Compiled Theories 

We have discussed more than a dozen subsets of the NNF language. Some of these subsets arc well 
known and have been studied extensively in the computer science literature. Others, such as DNNF 
(Darwiche, 2001a, 1999) and d-DNNF (Darwiche, 2001b), are relatively new. The question now is: 
What subset should one adopt for a particular application? As we argue in this paper, that depends 
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Acronym 


Description 


NNF 


Negation Normal Form 


DNNF 


Decomposable Negation Normal Form 


d-NNF 


Deterministic Negation Normal Form 


s-NNF 


Smooth Negation Normal Form 


f-NNF 


Flat Negation Normal Form 


d-DNNF 


Deterministic Decomposable Negation Normal Form 


sd-DNNF 


Smooth Deterministic Decomposable Negation Normal Form 


BDD 


Binary Decision Diagram 


FBDD 


Free Binary Decision Diagram 


OBDD 


Ordered Binary Decision Diagram 


OBDD< 


Ordered Binary Decision Diagram (using order <) 


DNF 


Disjunctive Normal Form 


CNF 


Conjunctive Normal Form 


PI 


Prime Implicates 


IP 


Prime Implicants 


MODS 


Models 



Table 1: Language acronyms. 



Property 


Short Description 


Flatness 


The height of NNF is at most 2 


Simple Disjunction 


Every disjunction is a clause, where literals share no variables 


Simple Conjunction 


Every conjunction is a term, whore literals share no variables 


Decomposability 


Conjuiiets do not sliar(> variables 


Deteriiiiuisiu 


Di.sjuiicts are logically (lisjoiut 


Smoothness 


Disjuncts mention the same set of variables 


Decision 


A node of the form true, false, or (X A a V -^X A 
where X is a variable and a, (5 are decision nodes 


Ordering 


Decision variables appear in the same order on any path in the NNF 



Table 2: Language properties. 
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on three key properties of the language: its succinctness, the class of tractable queries it supports, 
and the class of tractable transformations it admits. 

Our goal in this and the following sections is to construct a map on which we place different 
subsets of the NNF language according to the above criteria. This map will then serve as a guide to 
system designers in choosing the target compilation language most suitable to their application. It 
also provides an example paradigm for studying and evaluating further target compilation languages. 
We start with a study of succinctness^ in this section (Gogic, Kautz, Papadimitriou, & Selman, 1995). 

Definition 3.1 (Succinctness) Let Li and L2 be two subsets of NNF. Li is at least as succinct 

as L2, denoted Li < L2, iff there exists a polynomial p such that for every sentence a G L2, there 
exists an equivalent sentence /3 G Li where |/3| <p(|a|). Here, \a\ and are the sizes of a and /3, 
respectively. 

Wc stress here that wc do not require that there exists a function that computes [5 given a in 
polytime: we only require that a polysize (3 exists. Yet, our proofs in Appendix A contain specific 
algorithms for computing (3 from a in certain cases. The relation < is clearly reflexive and transitive, 
hence, a prc-ordering. One can also define the relation <, where Li < L2 iff Li < L2 and L2 ^ Li. 

Proposition 3.1 The results in Table 3 hold. 

An occurrence of < in the cell of row r and column c of Table 3 means that the fragment 
given at row r is at least as succinct as the fragment given at column c. An occurrence of ^ (or 
^*) means that is not at least as succinct as (provided that the polynomial hierarchy does 
not collapse in the case of ^*). Finally, the presence of a question mark reflects our ignorance about 
whether is at least as succinct as L^,. Figure 5 summarizes the results of Proposition 3.1 in terms 
of a directed acyclic graph. 

A classical result in knowledge compilation states that it is not possible to compile any proposi- 
tional formula a into a polysize data structure /3 such that: a and j3 entail the same set of clauses, 
and clausal entailment on /3 can be decided in time polynomial in its size, unless NP C P/poly 
(Selman & Kautz, 1996; Cadoli & Donini, 1997). This last assumption implies the collapse of the 
polynomial hierarchy at the second level (Karp & Lipton, 1980), which is considered very unlikely. 
We use this classical result from knowledge compilation in some of our proofs of Proposition 3.1, 
which explains why some of its parts are conditioned on the polynomial hierarchy not collapsing. 

We have excluded the subsets BDD, s-NNF, d-NNF and f-NNF from Table 3 since they do not 
qualify as target compilation languages (see Section 4). We kept NNF and CNF though given their 
importance. Consider Figure 5 which depicts Table 3 graphically. With the exception of NNF and 
CNF, all other languages depicted in Figure 5 qualify as target compilation languages. Moreover, with 
the exception of language PI, DNNF is the most succinct among all target compilation languages — we 
know that PI is not more succinct than DNNF, but we do not know whether DNNF is more succinct 
than PI. 

In between DNNF and MODS, there is a succinctness ordering of target compilation languages: 

DNNF < d-DNNF < FBDD < OBDD < OBDD< < MODS. 

DNNF is obtained by imposing decomposability on NNF; d-DNNF by adding determinism; FBDD by 
adding decision; and OBDD and OBDD< by adding ordering (w.r.t. any total ordering on PS in the 
first case and a specific one in the second case) . Adding each of these properties reduces language 
succinctness (assuming that the polynomial hierarchy does not collapse). 

One important fact to stress here is that adding smoothness to d-DNNF does not affect its suc- 
cinctness: the sd-DNNF and d-DNNF languages are equally succinct. It is also interesting to compare 

2. A more general notion of space efficiency (model preservation for polysize reductions) exists (Cadoli, Donini, 
Liberatore, & Schaerf, 1996), but we do not need its full generality here. 
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Table 3: Succinctness of target compilation languages. * means that the result holds unless the 
polynomial hierarchy collapses. 



sd-DNNF 




OBDD^ 






MODS 



Figure 5: An edge Li L2 indicates that Li is strictly more succinct than L2: Li < L2, while 
Li = L2 indicates that Li and L2 are equally succinct: Li < L2 and L2 < Li. Dotted 
arrows indicate unknown relationships; for instance, the dotted arrow from DNNF to PI 
means that we do not know whether DNNF is at least as succinct as PI. Some of the edges 
are conditioned on the polynomial hierarchy not collapsing — see Table 3. 



sd-DNNF (which is more succinct than the influential FBDD, OBDD and OBDD< languages) with MODS, 
which is a most tractable language. Both sd-DNNF and MODS are smooth, deterministic and de- 
composable. MODS, however, is flat and obtains its decomposability from the stronger condition 
of simple-conjunction. Therefore, sd-DNNF can be viewed as the result of relaxing from MODS the 
flatness and simple-conjunction conditions, while maintaining decomposability, determinism and 
smoothness. Relaxing these conditions moves the language three levels up the succinctness hierar- 
chy, although it compromises only the polytime test for sentential entailment and possibly the one 
for equivalence as we show in Section 4. 
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4. Querying a Compiled Theory 

In evaluating the suitability of a target compilation language to a particular application, the suc- 
cinctness of the language must be balanced against the set of queries and transformations that it 
supports in polytime. We consider in this section a number of queries, each of which returns valu- 
able information about a propositional theory, and then identify target compilation languages which 
provide polytime algorithms for answering such queries. We restrict our attention in this paper to 
the existence of polytime algorithms for answering queries, but we do not present the algorithms 
themselves. The interested reader is referred to (Darwiche, 2001a, 2001b, 1999; Bryant, 1986) for 
some of these algorithms and to the proofs of theorems in Appendix A for others. 

The queries we consider are tests for consistency, validity, implicates (clausal entailment), im- 
plicants, equivalence, and sentential entailment. We also consider counting and enumerating theory 
models; see Table 4. One can also consider computing the probability of a propositional sentence, 
assuming that all variables are probabilistically independent. For the subsets we consider, however, 
this can be done in polytime whenever models can be counted in polytime. 

From here on, L denotes a subset of language NNF. 

Definition 4.1 (CO, VA) L satisfies CO (VA) iff there exists a polytime algorithm that maps 

every formula E from h to 1 if Yi is consistent (valid), and to otherwise. 

One of the main applications of compiling a theory is to enhance the efficiency of answering 
clausal entailment queries: 

Definition 4.2 (CE) L satisfies CE iff there exists a polytime algorithm that maps every formula 
S from L and every clause 7 from NNF to 1 i/ E |= 7 holds, and to otherwise. 

A key application of clausal entailment is in testing equivalence. Specifically, suppose we have a 
design expressed as a set of clauses A'' — /\ - ai and a specification expressed also as a set of clauses 
A" = /\j j3j, and we want to test whether the design and specification are equivalent. By compiling 
each of A** and A* to targets and that support a polytime clausal entailment test, we can test 
the equivalence of A"^ and A** in polytime. That is, A"^ and A* are equivalent iff F'' |= (3j for all j 
and F** |= for all i. 

A number of the target compilation languages we shall consider support a direct polytime equiv- 
alent test: 

Definition 4.3 (EQ, SE) L satisfies EQ fSEj iff there exists a polytime algorithm that maps every 
pair of formulas S, <[> from Ltolz/E = $ (E^$j holds, and to otherwise. 

Note that sentential entailment (SE) is stronger than clausal entailment and equivalence. Therefore, 
if a language L satisfies SE, it also satisfies CE and EQ. 
For completeness, we consider the following dual to CE: 

Definition 4.4 (IM) L satisfies IM iff there exists a polytime algorithm that maps every formula 
S from L and every term 7 from NNF to 1 i/ 7 |= S holds, and to otherwise. 

Finally, we consider counting and enumerating models: 

Definition 4.5 (CT) L satisfi.es CT iff there exists a polytime algorithm that maps every formula 
E from L to a nonnegative integer that represents the number of models ofT, (in binary notation). 

Definition 4.6 (ME) L satisfies ME iff there exists a polynomial p{., .) and an algorithm, that 
outputs all models of an arbitrary formula E from L in time p{n,m), where n is the size ofS and 
m is the number of its models (over variables occurring in E^. 
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Notation 


Query 


CO 


polytime consistency check 


VA 


polytime validity clieck 


CE 


polytime clausal entailment clieck 


IM 


polytime implicant check 


EQ 


polytime equivalence check 


SE 


polytime sentential entailment check 


CT 


polytime model counting 


ME 


polytime model enumeration 



Table 4: Notations for queries. 



L 


CO 


VA 


CE 


IM 


EQ 


SE 


CT 


ME 


NNF 


o 


o 


o 


o 


o 


o 


o 


o 


DNNF 




o 




o 


o 


o 


o 




d-NNF 


o 


o 


o 


o 


o 


o 


o 


o 


s-NNF 


o 


o 


o 


o 


o 


o 


o 


o 


f-NNF 


o 


o 


o 


o 


o 


o 


o 


o 


d-DNNF 




^/ 




^/ 


? 


o 


^/ 




sd-DNNF 




^/ 




^/ 
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o 
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Table 5: Subsets of the NNF language and their corresponding polytime queries. ^ means "satisfies" 
and o means "does not satisfy unless P = NP." 



Table 4 summarizes the queries we are interested in and their acronyms. 

The following proposition states what we know about the availability of polytime algorithms for 
answering the above queries, with respect to all languages we introduced in Section 2. 

Proposition 4.1 The results in Table 5 hold. 

The results of Proposition 4.1 are summarized in Figure 4. One can draw a number of conclusions 
based on the results in this figure. First, NNF, s-NNF, d-NNF, f-NNF, and BDD fall in one equivalence 
class that does not support any polytime queries and CNF satisfies only VA and IM; hence, none 
of them qualifies as a target compilation language in this case. But the remaining languages all 
support polytime tests for consistency and clausal entailment. Therefore, simply imposing either 
of smoothness (s-NNF), determinism (d-NNF), flatness (f-NNF), or decision (BDD) on the NNF lan- 
guage does not lead to tractability with respect to any of the queries we consider — neither of these 
properties seem to be significant in isolation. Decomposability (DNNF), however, is an exception and 
leads immediately to polytime tests for both consistency and clausal entailment, and to a polytime 
algorithm for model enumeration. 
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Recall the succinctness ordering DNNF < d-DNNF < FBDD < OBDD < OBDD< < MODS 
from Figure 5. By adding decomposability (DNNF), we obtain polytime tests for consistency and 
clausal entailment, in addition to a polytime model enumeration algorithm. By adding determinism 
to decomposability (d-DNNF), wc obtain polytime tests for validity, implicant and model counting, 
which are quite significant. It is not clear, however, whether the combination of decomposability and 
determinism leads to a polytime test for equivalence. Moreover, adding the decision property on top 
of decomposability and determinism (FBDD) docs not appear to increase tractability with respect to 
the given queries^, although it does lead to reducing language succinctness as shown in Figure 5. On 
the other hand, adding the ordering property on top of decomposability, determinism and decision, 
leads to polytime tests for equivalence (OBDD and OBDD<) as well as sentential entailment provided 
that the ordering < is fixed (OBDD<). 

As for the succinctness ordering NNF < DNNF < DNF < IP < MODS from Figure 5, note that 
DNNF is obtained by imposing decomposability on NNF, while DNF is obtained by imposing flatness 
and simple-conjunction (which is stronger than decomposability). What is interesting is that DNF is 
less succinct than DNNF, yet does not support any more polytime queries; see Figure 4. However, the 
addition of smoothness (and determinism) on top of flatness and simple-conjunction (MODS) leads to 
five additional polytime queries, including equivalence and entailment tests. ^ 

We close this section by noting that determinism appears to be necessary (but not sufficient) for 
polytime modcil counting: only deterministic languages, d-DNNF, sd-DNNF, FBDD, OBDD, DBDD< and 
MODS, support polytime counting. Moreover, polytime counting implies a polytime test of validity, 
but the opposite is not true. 

5. Transforming a Compiled Theory 

A query is an operation that returns information about a theory without changing it. A transfor- 
mation, on the other hand, is an operation that returns a modific^cl theory, which is then operated 
on using queries. Many applications require a combination of transformations and queries. 

Definition 5.1 (AC,VC) Let L &e a subset o/NNF. L satisfies AC (VC) iff there exists a polytime 
algorithm that maps every finite set of formulas Ei, . . . , E„ from. L to a formula ofL that is logically 
equivalent to Ei A . . . A S„ fEi V ... V E„^. 

Definition 5.2 (-■C) Let h be a subset o/NNF. L satisfies -iC iff there exists a polytime algorithm 
that maps every formula E from L to a formula ofL that is logically equivalent to -lE. 

If a language satisfies one of the above properties, we will say that it is closed under the corre- 
sponding operator. Closure under logical connectives is important for two key reasons. First, it has 
implications on how compilers are constructed for a given target language. For example, if a clause 
can be easily compiled into some language L, then closure under conjunction implies that compiling 
a CNF sentence into L is easy. Second, it has implications on the class of polytime queries supported 
by the target language: If a language L satisfies CO and is closed under negation and conjunction, 
then it must satisfy SE (to test whether A ^ F, all we have to do, by the Refutation Theorem, 
is test whether A A -iF is inconsistent). Similarly, if a language satisfies VA and is closed under 
negation and disjunction, it must satisfy SE by the Deduction Theorem. 

3. Deciding the equivalence of two sentences in FBDD, d-DNNF, or in sd-DNNF, can be easily shown to be in coNP. 

However, wc do not have a proof of coNP-hardness, nor do we have deterministic polytime algorithms for deciding 
these problems. Actually, the latter case is quite unlikely as the equivalence problem for FBDD has been intensively 
studied, with no such algorithm in sight. Note, however, that the equivalence of two sentences in FBDD can be 
decided probabilistically in polytime (Blum, Chandra, & Wegman, 1980), and similarly for sentences in d-DNNF 
(Darwiche & Huang, 2002). 

4. Given flatness, simple-conjunction and smoothness, we can obtain determinism by simply removing duplicated 
terms. 



240 



A Knowledge Compilation Map 



It is important to stress here that some languages arc closed under a logical operator, only if the 
number of operands is bounded by a constant. We will refer to this as bounded closure. 

Definition 5.3 (ABC,VBC) Let L be a subset 0/ NNF. L satisfies ABC (VBCJ iff there exists 
a polytime algorithm that maps every pair of formulas E and $ from Jj to a formula of L that is 
logically equivalent ioEA$ fEW ^). 

We now turn to another important transformation: 

Definition 5.4 (Conditioning) (Darwiche, 1999) Let T, be a propositional formula, and let 7 be 
a consistent term. The conditioning of S on 7, noted T, \ j, is the formula obtained by replacing 
each variable XofEby true (resp. false) if X (resp. -^X) is a positive (resp. negative) literal 0/7. 

Definition 5.5 (CD) Let h be a subset o/NNF. L satisfies CD iff there exists a polytime algorithm 
that m,aps every formula S from L and every consistent term 7 to a formula from L that is logically 

equivalent to S | 7. 

Conditioning has a number of applications, and corresponds to restriction in the literature on 
Boolean functions. The main application of conditioning is due to a theorem, which says that S A 7 
is consistent iff S | 7 is consistent (Darwiche, 2001a, 1999). Therefore, if a language satisfies CO 
and CD, then it must also satisfy CE. Conditioning also plays a key role in building compilers 
that enforce decomposability. If two sentences Ai and A2 are both decomposable (belong to DNNF), 
their conjunction Ai A A2 is not necessarily decomposable since the sentences may share variables. 
Conditioning can be used to ensure decomposability in this case since Ai A A2 is equivalent to 
\/^(Ai I 7) A (A2 I 7) A 7, where 7 is a term covering all variables shared by Ai and A2. Note that 
\/^(Ai I 7) A (A2 I 7) A 7 must be decomposable since Ai | 7 and A2 | 7 do not mention variables 
in 7. The previous proposition is indeed a generalization to multiple variables of the well-known 
Shannon expansion in the literature on Boolean functions. It is also the basis for compiling CNF into 
DNNF (Darwiche, 1999, 2001a). 

Another critical transformation we shall consider is that of forgetting (also referred to as marginal- 
ization, or elimination of middle terms (Boole, 1854)): 

Definition 5.6 (Forgetting) Let S 6e a propositional formula, and let li- be a subset of variables 
from PS. The forgetting o/X from S, denoted 3X.S, is a formula that does not mention any variable 
from X and for every formula a that does not mention any variable from X, we have E |= a precisely 
when 3X.I] \= a. 

Therefore, to forget variables from X is to remove any reference to X from S, while maintaining all 
information that S captures about the complement of X. Note that 3X.S is unique up to logical 
equivalence. 

Definition 5.7 (FO, SFO) Let h be a subset of NNF. L satisfies FO iff there exists a polytime 
algorithm that maps every formula S from L and every subset X of variables from PS to a formula 
from L equivalent to 3X.E. // the property holds for singleton X, we say that L satisfies SFO. 

Forgetting is an important transformation as it allows us to focus/project a theory on a set of 
variables. For example, if we know that some variables X will never appear in entailment queries, 
we can forget these variables from the compiled theory while maintaining its ability to answer 
such queries correctly. Another application of forgetting is in counting/enumerating the instantia- 
tions of some variables Y, which are consistent with a theory A. This query can be answered by 
counting/enumerating the models of 3X.A, where X is the complement of Y. Forgetting also has 
applications to planning, diagnosis and belief revision. For instance, in the SATPLAN framework. 
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Notation 


Transformation 


CD 


polytime conditioning 


FO 


polytime forgetting 


SFO 


polytime singleton forgetting 


AC 


polytime conjunction 


ABC 


polytime bounded conjunction 


vc 


polytime disjunction 


VBC 


polytime bounded disjunction 


^C 


polytime negation 



Table 6: Notations for transformations. 
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Table 7: Subsets of the NNF language and their polytime transformations. ^ means "satisfies," • 
means "does not satisfy," while o means "does not satisfy unless P=NP." 



compiling away fluents or actions amoimts to forgetting variables. In model-based diagnosis, com- 
piling away every variable except the abnormality ones does not remove any piece of information 
required to compute the conflicts and the diagnoses of a system (Darwiche, 2001a). Forgetting has 
also been used to design update operators with valuable properties (Herzig & Rifi, 1999). 

Table 6 summarizes the transformations we are interested in and their acronyms. The following 
proposition states what we know about the tractability of these transformations with respect to the 
identified target compilation languages. 

Proposition 5.1 The results in Table 7 hold. 

One can draw a number of observations regarding Table 7. First, all languages we consider satisfy 
CD and, hence, lend themselves to efficient application of the conditioning transformation. As for 
forgetting multiple variables, only DNNF, DNF, PI and MODS permit that in polytime. It is important 
to stress here that none of FBDD, OBDD and OBDD< permits polytime forgetting of multiple variables. 
This is noticeable since some of the recent applications of OBDD< to planning — within the so-called 
symbolic model checking approach to planning (A. Cimmati & Traverso, 1997) — depend crucially 
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on the operation of forgetting and it may be more suitable to use a language that satisfies FO in 
this case. Note, however, that OBDD and OBDD< allow the forgetting of a single variable in polytime, 
but FBDD does not allow even that. d-DNNF is similar to FBDD as it satisfies neither FO nor SFO. 

It is also interesting to observe that none of the target compilation languages is closed under 
conjunction. A number of them, however, are closed under bounded conjunction, including OBDD<, 
DMF, IP and MODS. 

As for disjunction, the only target compilation languages that are closed under disjunction are 
DNNF and DNF. The OBDD< and PI languages, however, are closed under bounded disjunction. Again, 
the d-DMNF, FBDD and OBDD languages are closed under neither. 

The only target compilation languages that are closed under negation are FBDD, OBDD and OBDD<, 
while it is not known whether d-DNNF or sd-DNNF are closed under this operation. Note that d-DNNF 
and FBDD support the same set of polytime queries (equivalence checking is unknown for both) 
so they are indistinguishable from that viewpoint. Moreover, the only difference between the two 
languages in Table 7 is the closure of FBDD under negation, which does not seem to be that significant 
in light of no closure under either conjunction or disjunction. Note, however, that d-DNNF is more 
succinct than FBDD as given in Figure 5. 

Finally, OBDD< is the only target compilation language that is closed under negation, bounded 
conjunction, and bounded disjunction. This closure actually plays an important role in compiling 
propositional theories into OBDD< and is the basis of state-of-the-art compilers for this purpose 
(Bryant, 1986). 

6. Conclusion 

The main contribution of this paper is a methodology for analyzing propositional compilation ap- 
proaches according to two key dimensions: the succinctness of the target compilation language, and 
the class of queries and transformations it supports in polytime. The second main contribution 
of the paper is a comprehensive analysis, according to the proposed methodology, of more than 
a dozen languages for which we have produced a knowledge compilation map, which cross-ranks 
these languages according to their succinctness, and the polytime queries and transformations they 
support. This map allows system designers to make informed decisions on which target compilation 
language to use: after the class of queries/transformations have been decided based on the applica- 
tion of interest, the designer chooses the most succinct target compilation language that supports 
such operations in polytime. Another key contribution of this paper is the uniform treatment wc 
have applied to diverse target compilation languages, showing how they all are subsets of the NNF 
language. Specifically, we have identified a number of simple, yet meaningful, properties, including 
decomposability, determinism, decision and flatness, and showed how combinations of these proper- 
ties give rise to different target compilation languages. The studicicl subsets include some well known 
languages such as PI, which has been influential in AI; OBDD<, which has been influential in formal 
verification; and CNF and DNF, which have been quite influential in computer science. The subsets 
also include some; relatively new languages such as DNNF and d-DNNF, which appear to represent 
interesting, new balances between language succinctness and query/transformation tractability. 
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Appendix A. Proofs 

To simplify the proofs of our main propositions later on, we have identified a number of lemmas that 
we list below. Some of the proofs of these lemmas are direct, but we include them for completeness. 

Lemma A.l Every sentence in d-DNNF can be translated to an equivalent sentence in sd-DNNF in 
polytime. 

Proof: Let q: = ai V . . . V be an or-node in a d-DNNF sentence E. Suppose that a is not smooth 
and let V = Vars{a). Consider now the sentence obtained by replacing in S each such node 
by Vr=i ^» ^ /\vev\Vars{ai)i~''" ^ '^)- Then Eg is equivalent to S and is smooth. Moreover, Eg can 
be computed in time polynomial in the size of S and it satisfies decomposability and determinism. □ 

Lemma A. 2 Every sentence in FBDD can be translated to an equivalent sentence in FBDD n s-NNF 

in polytime. 

Proof: Let E be a sentence in FBDD and let a be a node in E. We can always replace a with {YAa)\/ 
(-■F A a), for some variable Y, while preserving equivalence and the decision property. Moreover, 
as long as the variable Y does not appear in a and is not an ancestor of a, then decomposability is 
also preserved (that is, the resulting sentence is in FBDD) . Note here that "ancestor" is with respect 
to the binary decision diagram notation of E-see left of Figure 2. 

Now, suppose that {X A a) V (^X A (3) is an or-node in E. Suppose further that the or-node 
is not smooth. Hence, there is some Y which appears in Vars{(3) but not in Vars{a) (or the other 
way around). Since E is decomposable, then Y cannot be an ancestor of a (since in that case it 
would also be an ancestor of /3, which is impossible by decomposability of E). Hence, we can re- 
place a with (F A a) V {^Y A a), while preserving equivalence, decision and decomposability. By 
repeating the above process, we can smooth E while preserving all the necessary properties. Finally, 
note that for every or-node {X A a) V {^X A /?) in E, we need to repeat the above process at most 
I Vars{a) — Vars{P) \ + \ Vars(/3) — Vars{a) \ times. Hence, the smoothing operation can be per- 
formed in polytime. □ 

Lemma A. 3 // a subset L o/NNF satisfies CO and CD, then it also satisfies ME. 

Proof: Let E be a sentence in L. First, we test if E is inconsistent (can be done in polytime). If 
it is, we return the empty set of models. Otherwise, we construct a decision-tree representation of 
the models of E. Given an ordering of the variables xi,. . . ,Xn of Vars(E), we start with a tree 
T consisting of a single root node. For i = 1 to n, we repeat the following for each leaf node a 
(corresponds to a consistent term) in T: 

a. If E I a A is consistent, we add Xi as a child to a; 

h. If E I a A -iXi is consistent, we add -^Xi as a child to a. 

The key points are: 

• Test (a) and Test (b) can be performed in time polynomial in the size of E (since L satisfies 
CO and CD). 
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• Either Test (a) or Test (b) above must succeed (since S is consistent). 

Hence, the number of tests performed is 0{mn), where m is the number of leaf nodes in the final 
decision tree (bounded by the number of models of E) and n is the number of variables of E. □ 

Lemma A. 4 If a subset o/NNF satisfies CO and CD, then it also satisfies CE. 

Proof: To test whether sentence E entails non-valid clause a, E ^ a, it suffices to test whether 
E I -la is inconsistent (Darwiche, 2001a). □ 

Lemma A. 5 Let a and (3 he two sentences that share no variables. Then ay (3 is valid iff a is valid 
or P is valid. 

Proof: a V /? is valid iff -^a A is inconsistent. Since -^a and share no variables, then -la A 
is inconsistent iff ->a is inconsistent or is. This is true iff a is valid or /3 is valid. □ 

Lemma A. 6 Let T, be a sentence in d-DNNF and let 7 &e a clause. Then a sentence in d-DNNF 
which is equivalent to 'EW j can be constructed in polytime in the size of E and 7. 

Proof: Let be the literals that appear in clause 7. Then (3 = Vr=i(^» ^ Aj=i 

equivalent to clause 7, is in d-DNNF, and can be constructed in polytime in size of 7. Now let a be 
the term equivalent to ^7. We have that E V 7 is equivalent to ((E | a) A a) V /3. The last sentence 
is in d-DNNF and can be constructed in polytime in size of E and 7. □ 

Lemma A. 7 // a subset o/NNF satisfies VA and CD, then it also satisfies IM. 

Proof: To test whether a consistent term q entails sentence E, a |= E, it suffices to test whether 
-■aVE is valid. This sentence is equivalent to -iccV (qAE), to -iqV (aA (E | a)), and to -laV (E | a). 
Since and E | a share no variables, the disjunction is valid iff -^a is valid or E | a is valid (by 
Lemma A. 5). cannot be valid since a is consistent. E | a can be constructed in polytime since 
the language satisfies CD and its validity can be tested in polytime since the language satisfies VA. □ 

Lemma A. 8 Every CNF or DNF formula can be translated to an equivalent sentence in BDD in 
polytime. 

Proof: It is straightforward to convert a clause or term into an equivalent sentence in BDD. In order 

to generate a BDD sentence corresponding to the conjimction (resp. disjunction) of BDD sentences a 
and /3, it is sufhcient to replace the 1-sink (resp. 0-sink) of a with the root of /?. □ 

Lemma A. 9 // a subset o/NNF satisfies EQ, then it satisfies CO and VA. 

Proof: true and false belong to every NNF subset. E is inconsistent iff it is equivalent to false. E is 
valid iff it is equivalent to true. □ 

Lemma A. 10 // a subset ofWiF satisfies SE, then it satisfies EQ, CO and VA. 

Proof: Sentences Ei and E2 are equivalent iff Ei \= E2 and E2 |= Ei. EQ implies CO and VA 
(Lemma A.9). □ 
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Lemma A. 11 Let S 6e a sentence in d-DNNF and let 7 6e a clause. The validity ofEV 'j can be 
tested in time polynomial in the size ofT, and 7. 

Proof: Construct S V 7 in polytime as given in Lemma A. 6 and check its validity, which can be 
done in polytime too. □ 

Lemma A. 12 For every propositional formula E and every consistent term 7, we have S|7 is 
equivalent to 
3Vars(7).(I] A7). 

Proof: Without loss of generality, assume that E is given by the disjunctively-interpreted set of 
its models (over Vars(E)). Conditioning S on 7 leads (1) to removing every model of -17, then 
(2) projecting the remaining models so that every variable of 7 is removed. Conjoining E with 7 
leads exactly to (1), while forgetting every variable of 7 in the resulting formula leads exactly to (2) 
(Lang, Liberatore, & Marquis, 2000). □ 

Lemma A. 13 Each sentence E in f-NNF can be converted into an equivalent sentence E* in poly- 
nomial time, where E* e CNF or E* e DMF. 

Proof: We consider three cases for the sentence E: 

1. The root node of E is an and- node. In this case, E can be turned into a CNF sentence E* in 
polynomial time by simply ensuring that each or-node in E is a clause (that is, a disjunction 
of literals that share no variables). Let C be an or-node in E. Since E is flat and its root is 
an and-node, C must be a child of the root of E and the children of C must be leaves. Hence, 
we can easily ensure that C is a clause as follows: 

• If we have one edge from C to some leaf X and another edge from C to -^X (C is valid), 
we replace the edge from the root to C by an edge from the root to true. 

• If we have more than one edge from C to the same leaf node X, we keep only one of these 

edges and delete the rest. 

2. The root of E is an or-nodc. E can be turned into a DNF sentence E* in a dual way.^ 

3. The root of E is a leaf node. E is already a CNF sentence. 

□ 

Lemma A. 14 a is a prime implicant (resp. an essential prime implicant) of sentence E iff is 
a prime implicate (resp. an essential prime implicate) of-'Jl. ^ 

Proof: This is a folklore result, immediate from the definitions. □ 



Proof of Proposition 3.1 

The proof of this proposition is broken down into eight steps. In each step, we prove a number of 
succinctness relationships between different languages, and then apply transitivity of the succinctness 
relation to infer even more relationships. Associated with each step of the proof is a table in which 
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HTfF I DHMF | d-DHMF | FBDD | DBDD | DBDD<; 



PI I IP I HODS I Bd-DHHF | 



NNF 
DNWF 



Table 8: 



I d-DMF I FBDD | DEPP 



DBDD 
OBDD^ 



Table 9: 



we mark all relationships that are proved in that step-we don't show these marks in the very first 
table though. 

Table 8: Follows immediately from the language inclusions reported in Figure 4. 

Table 9: We can prove both that DNF ^ PI and CNF ^ IP (this slightly generalizes the results 
DNF ^ CNF and CNF ^ DNF given in (Gogic et al, 1995)). 

Let us consider the CNF formula S„ = Ar=o^(^2i V X2i+i)- This formula is in prime implicates 
form^ (and each clause in S„ is an essential prime implicate of it). Hence its negation -iS„ is in 
prime implicants form (as an easy consequence of Lemma A. 14). 

Since Quine's early work (Quine, 1959), we know that the number of essential prime implicants 
(rcsp. prime implicates) of a formula is a lower bound of the number of terms (resp. clauses) that 
can be found in any DNF (resp. CNF) representation of it (indeed, any such representation must 
include the essential prime). S„ has 2" essential prime implicants. Indeed, this can be easily shown 
by induction on n given that (i) every literal occurring in S„ occurs only once, (ii) the set of prime 
implicants of any nontautological clause is the set of literals occurring in it (up to logical equivalence), 
and (iii) the distribution property for prime implicants (see e.g., (dual of) Proposition 40 in (Marquis, 
2000)) which states that IP{aA[3) = TOaa;({PlQ API/3 | PIq € JP(q:), PI/3 € IP{P)}, h) (up to logical 
equivalence). Subsequently, has 2" essential prime implicates (cf. Lemma A. 14). Accordingly, 
we obtain that both DNF ^ PI and CNF ^ IP. We also obtain PI ^ IP and IP ^ PI. Now, it is 
well-known that some DNF formulas have exponentially many prime implicants (see the proof of 
Proposition 5.1 where wc show that IP does not satisfy SFO). Hence, their negations arc CNF 

5. Note that f-NNF satisfies and that the negation of a CNF sentence (rcsp. DNF sentence) can be turned into a 
DNF (resp. CNF) in linear time. 

6. A prime implicant (rcsp. a prime imphcate) a of S is essential iff the disjunction (resp. conjunction) of all prime 
implicants (resp. prime implicates) of S except a is not equivalent to S. 

7. The correctness of (the dual of) Quine's consensus algorithm for computing prime implicants (Quine, 1955) 
ensures it, since no clause of S„ is subsumed by another clause and no consensi can be performed since there are 
no negated variables. 
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MHF I DHHF | d-DMHF | FBDD | " 



I DHF I CHF I PI I IP I MPDS | sd-DHNF | 



Table 10: 
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< 
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Table 11: 



formulas having exponentially many prime implicates. Subsequently IP ^ DNF and PI ^ CNF. The 
remaining results in this table follow from the transitivity of <. 

Table 10: The parity function 0„ = ®"^q Xi has linear size OBDD< representations (Bryant, 
1986) but only exponential size CNF and DNF representations. The reason is that 0„ has 2" 
essential prime implicants (resp. essential prime implicates) and the number of essential prime 
implicants (resp. essential prime implicates) of a formula is a lower bound of the size of any of its 
DNF (resp. CNF) representation. This easily shows that both CNF ^ OBDD and DNF ^ OBDD. The 
remaining results in this table follow from the language inclusions reported in Figure 4. 

Table 11: It is shown in (Darwiche, 2001b) that there is a sentence in d-DNNF which only 
has exponential FBDD representations. Accordingly, we have FBDD ^ d-DNNF. In (Gergov & Meinel, 
1994a), it is shown that OBDD ^ FBDD. Finally, it is easy to show that OBDD< ^ OBDD (for instance, 
the formula I]„ = /\"^i{xi <4> tji) has an OBDD< representation of size polynomial in n whenever < 
satisfies xi < yi < X2 < ■ ■ ■ < Xn < y-m while it has an OBDD< representation of size exponential in 
n provided that < is s.t. xi < X2 < ■ ■ ■ < Xn < yi < y2 < ■ ■ ■ < Vn)- The remaining results in this 
table follow from the language inclusions reported in Figure 4. 

Table 12: L' ^* L means that L' ^ L unless the polynomial hierarchy PH collapses. The 
results in this table follow since the existence of polysize knowledge compilation functions for clausal 
entailment implies the collapse of the polynomial hierarchy PH (Selman & Kautz, 1996; Cadoli & 
Donini, 1997). Now, if DNNF < CNF, then for each sentence S in CNF there exists a polysize equivalent 
sentence F in DNNF. Therefore, we can test whether a clause is entailed by S in polytime by testing 
whether the clause is entailed by F. This proves the existence of polysize knowledge compilation 
functions for clausal entailment, leading to the collapse of the polynomial hierarchy PH. The same 
is true for d-DNNF and sd-DNNF since all these languages support a polytime clausal entailment test 
(see Proposition 4.1). 

Table 13: In (Wegener, 1987) (Theorem 6.2 pp. 436), a family of n^-variable boolean fimctions 
S is pointed out. Provided that every interpretation / over these variables represents a n- vertices 
digraph (for every 1 < i, j < n, we have I{xij) = 1 iff is an arc of the digraph), S(J) = 1 iff the 
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L I MHF I DHMF | d-DNMF | FBDD | DBDD | DBDD ^ | DBF | CHF | PI | IP | HODS | sd-DHHF | 
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Table 12: 



L I NNF I DMMF | d-DMtJF | FBDD | OBDD | OBDD^ | DNF | CHF | PI | IP | HODS | sd-DNMF 
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Table 13: 

digraph represented by / contains a fc-cliq\ie of a spceial kind (fc is a parameter of the family). It is 
shown that for certain values of k (depending on n), every FBDD representation of S has exponential 
size. Moreover, it is shown that S has only a cubic number of prime implicants. This shows that 
FBDD ^ IP, hence FBDD ^ DNF. Because FBDD satisfies (sec Proposition 5.1),^ it cannot be the 
case that -iS has a polynomial size FBDD. Since -iS has only a cubic number of prime implicates, we 
obtain that FBDD ^ PI, hence FBDD ^ CNF. The remaining results in this table follow since FBDD< 
OBDD < OBDD<. 

Table 14: Assume that d-DNNF < DNF holds. As a consequence, every sentence S in DNF can be 
compiled into an equivalent d-DNNF sentence S* of polynomial size. Now, checking whether a clause 
7 is entailed by the CNF sentence S is equivalent to checking whether the DNF sentence V 7 is 
valid. Checking whether (-iE)*V7 is valid — when (-iS)* is a d-DNNF sentence and 7 is a clause — can 
be achieved in polynomial time by Lemma A. 11. Therefore, (^E)* is a polysize compilation of the 

8. That is, a sentence in FBDD can be negated in polytime to yield a sentence in FBDD too. 



L I NNF I DHMF | d-DNNF | FBDD | DBDD | DBDD^ | DNF | CNF | PI | IP | MODS | sd-DNNF | 



NNF 
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Table 14: 
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L I HHF I DHTTF | d-DHHF | FBDD | QBDD | QBDD<; | DHF | CHF | PI | IP | HDDS | Bd-DHMF | 
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Table 15: 

CNF sentence S, allowing clausal entailment to be achieved in polynomial time. The existence of 
such (-iS)* for every CNF sentence S implies the collapse of the polynomial hierarchy (Selman & 
Kautz, 1996; Cadoli & Donini, 1997). Hence, we obtain that d-DNNF ^* DNF. As a consequence, we 
also have d-DNNF ^* DNNF. Finally, since every d-DNNF sentence can be turned in polynomial time 
into an equivalent sd-DNNF sentence by Lemma A.l, we have sd-DNNF < d-DNNF. Moreover, since 
d-DNNF < sd-DNNF, we obtain sd-DNNF ^* DNF, sd-DNNF ^* DNNF, sd-DNNF < FBDD, sd-DNNF < 
OBDD, sd-DNNF < OBDD<, FBDD ^ sd-DNNF, OBDD< ^ sd-DNNF, DNF ^ sd-DNNF, CNF ^ sd-DNNF, PI 
^ sd-DNNF and IP ^ sd-DNNF. 

Table 15: Let us now show that MODS is not less succinct than PI, IP, sd-DNNF and OBDD. 
First, let us consider the formula S = ViLi ^ '"^^ represented by PI, IP, sd-DNNF and OBDD 
formulas of size polynomial in n. Contrastingly, E cannot be represented by a MODS formula of 
size polynomial in n since S has 2" — 1 models over Vars(E). Now, it is well-known that the old 
good Quinc-McCluskey's algorithm for generating prime implicants from a MODS representation of 
a propositional formula S runs in time polynomial in the number of models of E (Wegener, 1987). 
This shows that IP < MODS. As to CNF and OBDD<, it is obvious that a decision tree (or Shannon tree) 
for E that respects a given total ordering over Vars{Yi) can be generated in polynomial time from a 
MODS representation of E. Such a decision tree has m 1-leaves where m is the number of models of 
E over Vars{Y,). Accordingly, it has at most n*m 0-leaves where n = \ Vars{Yi)\. Since the set of all 
paths from the root of the tree to any 0-lcaf can be read as a CNF representation of E, we obtain that 
CNF < MODS. On the other hand, since reducing a decision tree to derive a corresponding OBDD< 
can be done in polynomial time, it follows that an OBDD< representation of E can also be generated 
from a MODS representation of it. Hence, OBDD< < MODS. The remaining results in this table follow 
from the language inclusions reported in Figure 4. □ 

Proof of Proposition 4.1 

The proof of this proposition is broken down into twelve steps. In each step, we prove a number 
of results. Associated with each step of the proof is a table in which we mark all results that are 
proved in that step. The table of the last step includes all results declared by this proposition. 

Table 16: Every classical CNF or DNF formula can be translated in a straightforward way into 
an equivalent f-NNF sentence (with a tree structure) in polytime. Moreover, every NNF sentence 
can be translated into an equivalent s-NNF sentence in polytime (Lemma A.l). Given that CO 
is NP-hard (resp. VA is coNP-hard) for classical CNF (resp. DNF) sentences, and the inclusion 
between the various NNF subsets reported in Figure 4, we obtain the table. 

Table 17: SE implies both CO and VA (Lemma A. 10). Moreover, since CT implies both CO 
and VA, IM implies VA (valid term), and CE implies CO (inconsistent clause), we obtain the 
table. 
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L I CO I VA I CE I IM I EQ | CT | SE | ME | 
NWF I o I I o I 

DNNF I o I 

d-WHF 
d-DNNF 
BDD 
FBDD 
□BDD 
DBDD^ 
DNF I o I 

CNF I o I 

MODS 

B-NNF I o I I o I 

f-NNF I o I I o I 

ad-DNNF 



Table 16: 



L I CO I VA I CE I IM I EQ | CT | SE | ME | 

MF o o I o I I oT I o I " I o I 

DNNF o I o I I o I I o I 

d-NNF 
d-DNWF 
BDD 
FBDD 
OBDD 

QBDD^ . - — 

DNF o I o I I o I I o I 

CNF o I o I I o I I o I 

MODS 

s-HNF o o I o I I oT I o I " I o I 

f-NNF o o I o I I o I I o I I o I 

sd-DNNF 



Table 17: 



L I CO I VA I CE I IM I EQ | CT | SE | ME | 
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Table 18: 
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L I CO I VA I CE I IM I EQ | CT | SE | ME | 
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Table 19: 



L I CO I VA I CE I IM I EQ | CT | SE | ME | 
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Table 20: 



Table 18: A sentence E is consistent (resp. valid) iff it has a model (resp. 2" models, where 
n = I Vars{Yi)\). Moreover, the number of models of S is given by the number of edges outgoing 
from the or-node in any MODS representation of E. Accordingly, CO, VA and CT can be achieved 
in polynomial time when S is given by a MODS formula which gives us the table. 

Table 19:Because DNNF satisfies CE (Darwiche, 2001a), CE implies CO and MODS C DNF C 
DNNF, IPC DNF and 

sd-DNNF C d-DNNF C DNNF, we obtain the table. 
Table 20: We now use the following results: 

CD and CO imply CE (Lemma A.4). 

CD and VA imply IM (Lemma A. 7). 

CD and CO imply ME (Lemma A.3). 

All considered NNF subsets satify CD (cf. Proposition 5.1). 

If an NNF subset does not satisfy CO it cannot satisfy ME. 

It is well-known that FBDD satisfies CO, VA and CT, and that OBDD< satisfies (in addition) 
EQ (Gergov & Meinel, 1994a; Bryant, 1992). 

Since E |= a holds iff E A is inconsistent and since OBDD< satisfies CO, -iC and ABC (cf. 
Proposition 5.1), OBDD< also satisfies SE. 
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L I CO I VA I CE I IM I EQ | CT | SE | ME | 



KKF 


o 


o 


o 


o 




o 


o 


o 


DHHF 


V 


o 




o 




o 


o 


■J 


d-NNF 


















d-DNNF 


















BDD 


















FBDD 






%/ 










%/ 


DBDD 


V 


V 




%/ 


•J 




o 




OBDD<; 


V 


V 






•J 




v" 




DNF 


V 


o 


•J 


o 




o 


o 




CNF 


o 


Ivl 


o 


Ivl 




o 


o 


o 


PI 








Ivl 










IP 


✓ 




■J 










n/ 


HDDS 






•J 






■J 




n/ 


s-NNF 


o 


O 


o 


o 




o 


o 


o 


f-NNF 






o 


o 




o 




o 


Sd-DNMF 



















Table 21: 



L I CO I VA I CE I IM I EQ | CT | SE | ME 
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Table 22: 

Obviously enough, any query concerning OBDD is equivalent to the corresponding query con- 
cerning OBDD< provided that only one DAG is brought into play. Together with the above 
results, we conclude that OBDD satisfies CO, VA and CT. Since this fragment satisfies CD as 
well, it satisfies CE, IM and ME in addition. It also satisfies EQ (see Theorem 8.11 from 
(Meinel & Theobald, 1998)) but does not satisfy SE (unless P = NP). Indeed, it is known 
that checking the consistency of two OBDD< formulas a and /? (based on two different vari- 
able orderings <) is NP-complete (Lemma 8.14 from (Meinel & Theobald, 1998)). Since OBDD 
satisfies -iC and since a A /3 is consistent iff a ^ -■/?, checking sentential entailment for OBDD 
formulas is co NP-complete. 

These results lead to the table. 

Table 21: It is known that IM is satisfied by classical CNF formulas (hence. Pi) (in order to 
check whether a non-valid clause is implied by a consistent term, it is sufficient to test that they 
share a literal). CNF (hence, PI) is also known to satisfy VA. Wc then obtain the table. 

Table 22: Every sentence in CNF or DNF can be turned into an equivalent sentence in BDD in 
polytime (Lemma A. 8). Hence, a o in a CNF or DNF cell implies a o in the corresponding BDD cell. 
Similarly, since BDD C d-NNF, a o in a BDD cell implies a o in the corresponding d-NNF cell. This 
leads to the table. 

Table 23: Since EQ implies CO and VA (Lemma A. 9), a o in a CO or VA cell implies a o in 

the corresponding EQ cell. This leads to the table. 

Table 24: By definition, PI satisfies CE and IP satisfies IM. Since PI C CNF and IP C DNF, 
this implies that both PI and IP satisfy SE. Now, SE impUes EQ, hence both PI and IP satisfy EQ 

(actually, two equivalent formiilas share the same prime implicates and the same prime implicants 
(both forms are canonical ones, provided that one representative per equivalence class is considered, 
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•J 
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v/ 
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L I CO I VA I CE I IM I EQ | CT | SE | ME 



NNF 
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DNNF 




o 




o 


o 


o 


o 


■J 
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Table 24: 
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L I CO I VA I CE I IM I EQ I CT | SE | ME | 
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Table 26: 



only)). Since PI satisfies CE, it also satisfies CO. Since it satisfies CD as well (cf. Proposition 5.1), 
it also satisfies ME (Lemma A. 3). Contrastingly, the models counting problem for monotone Krom 
formulas (i.e. conjunctions of claiiscs containing at most two literals and only positive literals) 
is #P-complete (Roth, 1996). Such formulas can easily be turned into prime implicates form in 
polynomial time (Marquis, 2000), hence PI does not satisfy CT. Now, since the negation of a 
formula S in prime implicates form is a formula in prime implicants form (cf. Lemma A. 14), and 
since the number of models of -iS over Vars(S) is 2l^'^'~"(^)l minus the number of models of S over 
V'ars(E), we necessarily have that IP does not satisfy CT. This also imply that IP satisfies VA, 
leading to the tabic. 

Table 25: In the proof of Proposition 3.1, we have shown that the prime implicants of S can 
be computed in polytime from a MODS representation of S. As an immediate consequence, since IP 
satisfies IM, EQ and SE, we obtain that MODS satisfies IM, EQ and SE, leading to the table. 

Table 26: Since d-DNNF satisfies CT (Darwiche, 2001b), it also satisfies VA. Since it satisfies 
CD (Proposition 5.1), it also satisfies IM as well (Lemma A.7). Since sd-DNNF C d-DNNF, these 
results follow for sd-DNNF. Hence, we obtain the table. 

Table 27: It is known that determining whether the conjunction of two FBDD formulas ai and a2 
is consistent is NP-complete (Gergov & Meinel, 1994b) Moreover, FBDD satisfies -iC. Since ai A a2 
is inconsistent iff ai \= ^q;2, we can reduce the consistency test into an entailment test. Hence, FBDD 
does not satisfy SE. Since FBDD C d-DNNF, d-DNNF does not satisfy SE either. Finally, since every 
d-DNNF can be translated into an equivalent sd-DNNF sentence in polytime (Lemma A.l), sd-DNNF 
does not satisfy SE either. This leads to the final table above. □ 
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Table 27: 

Proof of Proposition 5.1 

The proof of this proposition is broken down into eight steps. Each step corresponds to one of the 
transformations, where we prove all results pertaining to that transformation. 

• CD. To show that a language L satisfies CD, we want to show that for any sentence S e L 
and any consistent term 7, we can construct in polytime a sentence which belongs to L and is 
equivalent to S | 7. 

— NNF, f-NNF, CNF and DNF. The property is trivially satisfied by these languages: If S 
belongs to any of these languages, then replacing the literals of 7 by a Boolean constant 
in S results a sentence in the same language. In the case of DNF (resp. CNF), some 
inconsistent terms (valid clauses) may result through conditioning, but these can be 
removed easily in polynomial time. 

— DNNF. It is sufficient to prove that conditioning preserves decomposability. For every 

propositional sentences a, (3 and every consistent term 7, if a and (3 do not share variables, 
then a|7 and /3|7 do not share variables either since Vars{a\^) C Vars{a) and Vars{l3\^) C 
Vars{l3). 

— d-NNF and d-DNNF. Since NNF and DNNF satisfy CD, it is sufficient to prove that condition- 
ing preserves determinism, i.e. for every propositional formulas a, (3 and every consistent 
term 7, if a A /3 |= false, then (a|7) A (/3|7) |= false. If a A f3 \= false, then for every term 
7, we have (a A /3) A 7 |= false. Since (a A /3) A 7 = ({a A (3)\j) A 7, this implies that 
((a A/3)|7) A7 \= false. Since 7 is consistent and share no variable with (aA/3)|7, it must 
be the case that {a A /3)|7 is inconsistent. This is equivalent to state that (a|7) A {(3\-f) ^ 
false. 

— s-NNF and sd-DNNF. Since NNF satisfies CD, and since conditioning preserves decompos- 
ability and determinism, all we have to show is that conditioning also preserves smooth- 
ness. This follows immediately since for two propositional sentences a, and a consistent 

term 7, we have Vars(a) = Vars{P) only if Vars(a \ 7) = Vars{f3 \ 7). 

— BDD, FBDD, OBDD and OBDD<. It is well-known that BDD satisfies CD- the conditioning 
operation on binary decision diagrams is known as the restrict operation (Bryant, 1986). 
To condition a sentence T, in BDD on a consistent term 7, we replace every node labeled 
by a variable in 7 by one of its two children, according to the sign of the variable in 7. 
The resulting sentence is also a BDD and is equivalent to S | 7. The same applies to FBDD, 
OBDD and OBDD<. 

— PI. The prime implicates of S A 7 can be computed in polytime when S is in prime 
implicates form and 7 is a term (see Proposition 36 in (Marquis, 2000)). Moreover, since 
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PI satisfies FO (see below), the prime implicates of 3 Vars (7). (E A 7) can be computed in 
polytime. But these are exactly the prime implicates of S | 7 according to Lemma A. 12. 

— IP. Let Ti = ViLi 7i be a formula in prime implicants form. It is clear that the formula 
(Vr=i ')'«) I 7 is a DNF formula equivalent to E | 7. Now, our claim is that the formula E* 
obtained by keeping only the logically weakest terms 7^ | 7 among (Vi=i 7«) | 7 is a prime 
implicants formula equivalent to E | 7. Removing such terms clearly is truth-preserving. 
Since generating E* requires only O(n^) entailment tests among terms, and since such 
tests can be easily achieved in polynomial time, we obtain that IP satisfies CD. Now, 
how to prove that E* is in prime implicants form? Since any pair of different terms of 
E* cannot be compared w.r.t. logical entailment, the correctness of Quine's consensus 
algorithm for generating prime implicants shows that it is sufficient to prove that every 
consensus among two terms of E* is inconsistent or entails another term of E*. Let's 
recall that consensus is to DNF formulas what resolution is to CNF formulas. Since E 
is in prime implicants form, every consensus among two terms of E is inconsistent or 
entails another term of E. What happens to the terms (here, the prime implicants) of E 
when conditioned by 7? All those containing the negation of a literal of 7 are removed 
and the remaining ones are shortened by removing from them every literal of 7. Hence, 
for every pair of terms 71, 72 of E, if there is no consensus between 71 and 72, then 
there is no consensus between 71 17 and 72 17: conditioning cannot create new consensus. 
Now, it remains to prove that no unproductive consensus between terms of E can be 
rendered productive through conditioning. Formally, let 71 = 7^ A ? and 72 = 72 A -il be 
two prime implicates of E s.t. I (resp. -■/) does not appear in 7^ (resp. 72). There is a 
consensus "/[ A 72 between 71 and 72. Let us assume that both 71 and 72 have survived the 
conditioning: this means that both 71 17 and 72 17 are consistent. Especially, I belongs to 
71I7 and -il belongs to 72|7- Accordingly, there is a consensus between 71I7 and 72|7- By 
construction, this consensus is equivalent to (7i|7)A(72|7), hence equivalent to (7iA72)|7. 
Now, if j[ A 72 is inconsistent, then (7^ A 72) I7 is inconsistent as well and we are done. 
Otherwise, let us assume that there exists a prime implicant 73 of E s.t. 7i A 72 |= 73 
holds. Necessarily, 73 is preserved by the conditioning of E by 7. Otherwise, 73 would 
contain the negation of a literal of 7, but since every literal of 73 is a literal of 71 or a 
literal of 72, 72 and 73 would not have both survived the conditioning. Since 7i A72 |= 73 
holds, we necessarily have (7^ A 72) I7 \= 73 17- This completes the proof. 

— MODS. Direct consequence of Lemma A. 12 and the fact that MODS satisfies ABC and FO 
(see below). 

• FO. 

- DNNF and DNF. It is known that DNNF satisfies FO (Darwiche, 2001a). It is also known 
that DNF satisfies FO (Lang et al., 2000). 

- NMF, s-MMF, f-MNF, d-NNF, BDD and CNF. Let E be a sentence in CNF. We now show that 

if any of the previous languages satisfies FO, then we can test the consistency of E in 
polytime. Since CNF does not satisfy CO (see Proposition 4.1), it then follows that none 
of the previous languages satisfy FO unless P = NP. First, we note that E must also 
belong to NNF and f-NNF. Moreover, E can be turned into a sentence in BDD in polytime 
(Lemma A. 8) or a sentence in s-NNF in polytime (see the proof of Lemma A.l). We 
also have that E can be turned into a sentence in d-NNF in polytime since BDD C d-NNF. 
Suppose now that one of the previous languages, call it L, satisfy FO. We can test the 
consistency of E in polytime as follows: 

* Convert E into a sentence E* in L in polytime (as shown above). 

* Compute 3 Vars(E*).E*, which can be done in polytime by assumption. 
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* Test the validity of 3 Var.s(S*).S!*, which can be done in polytime since the sentence 
contains no variables — all we have to do is check whether the sentence evaluates to 
true. 

Finally, note that the definition of forgetting implies that a sentence T is consistent iff 
3 Vars(r).r is valid, which completes the proof. 

— d-DNNF and sd-DNNF. Follows immediately since none of these languages satisfies SFO 
unless P = NP (see below). 

— IP. Follows immediately since IP does not satisfy SFO. 

— FBDD, OBDD and OBDD<. We will show that if FBDD (resp. OBDD, OBDD<) satisfies FO, 

then for every sentence F in DNF, there must exist an equivalent sentence E in FBDD (resp. 
OBDD, OBDD<), which size is polynomial in the size of F. This contradicts the fact that 
FBDD (resp. OBDD, OBDD<) ^ DNF— see Table 3. 

Given a DNF F consisting of terms 71, 7„, we can convert each of these terms into equiv- 
alent FBDD (resp. OBDD, OBDD<) sentences ai,...,a„ in polytime. Let {vi, . . . ,Vn-i} 
be a set of variables that do not belong to PS. Construct a new set of variables 
PS' = PS U {vi, . . . ,Vn-i}- In case of OBDD and OBDD<, we also assume that these 
new variables are earlier than variables PS in the ordering. Consider now the sentence 
S = 3{vi, . . . , w„_i}.A^, with respect to variables PS', where A* is inductively defined 
by: 

* A' = ai, for i = n, and 

* A' = (ctj A Vi) V (A*+i A -^Vi), for i = 1, .... n - 1. 

Clearly enough, an FBDD (resp. OBDD, OBDD<) sentence equivalent to A^ can be computed 
in time polynomial in the input size. Moreover, we have E = Vr=i ^» = Vr=i = ^■ 
Hence, if FBDD (resp. OBDD, OBDD<) satisfies FO, then we can convert the DNF sentence F 
into an equivalent FBDD (resp. OBDD, OBDD<) which size is polynomial in the size of the 
given DNF. This is impossible in general. 

— PI. It is known that the prime implicates of 3X.E are exactly the prime implicates of E 
that do not contain any variable from X (see Proposition 55 in (Marquis, 2000)). Hence, 
such prime implicates can be computed in time polynomial in the input size when E is 
in prime implicates form. 

— MODS. Given a MODS formula E and a subset X of PS, the formula obtained by removing 
every leaf node (and the corresponding incoming edges) of E labeled by a literal x or -ix 
s.t. X G X is a, MODS representation of 3X.E — this is an easy consequence of Propositions 
18 and 20 from (Lang et al., 2000). See also the polytime operation of forgetting on DNNF, 
as defined in (Darwiche, 2001a), which applies to MODS, since 

MODS C DNNF, and which can be easily modified so it guarantees that the output is in 
MODS when the input is also in MODS. 

• SFO. 

— DNNF, DNF, PI and MODS. Immediate from the fact that each of these languages satisfies 
FO (see above). 

— NNF, d-NNF, s-NNF, f-NNF, BDD, OBDD< and CNF. Direct from the fact that 3a;.E = (E|x) V 
(Ej-ix) holds and the fact that any of these fragments satisfies CD and VBC. 

— OBDD. Direct from the fact that only one OBDD sentence is considered in the transformation 
and OBDD< satisfies SFO. 

— d-DNNF, sd-DNNF and FBDD. Let ai and a2 be two FBDD formulas. Let x be a variable^ 
not included in Vars{ai) U Vars{a2). The formula E = (a; A ai) V {-ix A is a FBDD 
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formula since decomposability and decision are preserved by this construction. Since 3x.S 
is equivalent to ai V Q2, if FBDD would satisfy SFO, it would satisfy VBC as well, but 
this is not the case unless P = NP (see below). The same conclusion can be drawn for 
d-DNNF. Hence, FBDD and d-DNNF do not satisfy SFO unless P = NP. Since every d-DNNF 
formula can be turned in polynomial time into an equivalent sd-DNNF formula, we obtain 
that sd-DNNF does not satisfy SFO unless P = NP. 

— IP. Let us show that the number of prime implicants of 3a;. S can be exponentially greater 
than the number of prime implicants of S. Let S' be the following DNF formula: 



E' has (to + l)*^ + mk primes implicants (Chandra & Markowsky, 1978). Now, let S be 
the formula: 



Since S' can be obtained from E by removing in every term of E every occurrence of x 
and -ix, E' is equivalent to 3{a;}.E (see (Lang et al., 2000)). Now, E has only mk + 1 
prime implicants; indeed, every term of it is a prime implicant, and the converse holds 
since every term is maximal w.r.t. logical entailment and every consensus of two terms 
is inconsistent. This completes the proof. 



— NNF, s-NNF, d-NNF, CNF. The property is trivially satisfied by these languages since deter- 
minism and smoothness are only concerned with or-nodes. Hence, if ai, . . . , a„ belong to 
one of these languages, so is ai A . . . A «„. 

— ODD. It is well-known that the conjunction of two BDDs a and /3 can be easily computed 

by connecting the 1-sink of a to the root of f3 (see proof of Lemma A. 8). The size of the 
resulting BDD is just the sum of the sizes of the respective BDDs of a and /?. Accordingly, 
we can repeat this operation n times in time polynomial in the input size. 

— f-NNF. Direct from the fact that f-NNF does not satisfy ABC. 

— FBDD, OBDD, OBDD<, DNF, PI and IP. It is straightforward to convert a clause into an 
equivalent formula in any of these languages in polynomial time. In the proof of Proposi- 
tion 3.1, we show specific CNF formulas which cannot be turned into an equivalent FBDD 
(rcsp. OBDD, OBDD<, DNF, PI and IP) formulas in polynomial space (see Tables 9 and 10). 
Hence, such conversion cannot be accomplished in polynomial time either. This implies 
that none of FBDD, OBDD, OBDD<, DNF, PI and IP satisfies AC. 

— DNNF, d-DNNF and sd-DNNF. Direct from the fact that none of these languages satisfy 
ABC unless 



— MODS. Let E = AlLi^*' where E, = {xi^i V Xi^2), i € l..n. Each Ej has 3 models over 
Vars(Ei). Since E has 3" models, it does not have a MODS representation of size polynomial 
in the input size. 





• AC. 



P = 



NP. 



• ABC. 
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— NNF, s-NNF, d-NNF, BDD and CNF. Immediate since each of these languages satisiy AC (see 
above). 

— DNNF, d-DNNF, sd-DNNF, FBDD and OBDD. Checking whether the conjunction of two OBDD< 
formulas ai and (w.r.t. two different variable orderings <) is consistent is NP-complete 
(sec Lemma 8.14 in (Mcincl & Theobald, 1998)). Since OBDD satisfies CO, it cannot satisfy 
ABC unless P = NP. Since OBDD C FBDD C d-DNNF C DNNF, and d-DNNF and DNNF satisfy 
CO, none of them can satisfy ABC unless P = NP. Finally, since every d-DNNF formula 
can be turned in polynomial time into an equivalent smoothed d-DNNF formula and since 
sd-DNNF satisfies CO, it cannot be the case that sd-DNNF satisfy ABC unless P = NP. 

— OBDD<. Well-known fact (Bryant, 1986). 

— f-NNF. Let tti = t\iZn{xn V Xnj^x) be a CNF formula and 0:2 — V"=o^(^2i ^ •^2i+i) ^ 
formula. a\ has 2" essential prime implicants and n essential prime implicates (see the 
proof of Proposition 3.1, Table 9). By duality, a2 has n essential prime implicants and 2" 
essential prime implicates. Now, ol\ and a2 are two f-NNF formulas. By Lemma A. 13, we 
know that every f-NNF formula /? can be turned in polynomial time into a CNF formula or 
a DNF formula. If f-NNF would satisfy ABC, then a f-NNF formula /? s.t. /? = ai Aa2 could 
be computed in time polynomial in the input size. Hence, either a CNF formula equivalent 
to Oil A Q!2 or a DNF formula equivalent to (X\ A a-i could be computed in polytime. But 
this is impossible since a\ A a-i has n + 2" essential prime implicates and n * 2" essential 
prime implicants. Hence every CNF (resp. DNF ) formula equivalent to ol\ A ai has a size 
exponential in |q!i| + |q;2|- 

Note that in the case where the two f-NNF formulas ol\ and 0L2 into consideration can 
be turned in polynomial time into either two CNF formulas or two DNF formulas, then a 
f-NNF formula equivalent to q.\ A a2 can be computed in time polynomial in the input 
size (this is obvious when two CNF formulas are considered and the next item of the proof 
shows how this can be achieved when two DNF formulas are considered). 

— DNF and MODS. If ai and a2 are sentences in one of these languages L, then we can construct 
a sentence in L which is equivalent to ai Aq!2 by simply taking all the conjunctions of one 
term from ai and one term from a2, while removing redundant literals in the resulting 
terms and removing any inconsistent terms in the result. The disjunction of all the 
resulting terms is a sentence from L equivalent to ai A 0:2 and it has been computed in 
polynomial time. 

— PI. Let ai = Vi=i Pi and a2 = Ai=i Kj=i{~^Pi^ lid)- Sentence ai has one prime implicate 
and a2 has m*k prime implicates. But a\ A a2 has {m+ 1)'^ +m* k prime implicates 
(Chandra & Markowsky, 1978). 

— IP. Let IP{a) be the set of prime implicants for a. We have IP{ai A a2) = max({/3i A 
02 \ 01 & IP{ai),P2 G IP{a2)}, \=) (up to logical equivalence). See e.g., (dual of) Propo- 
sition 40 in (Marquis, 2000). 
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• VC. 

— NMF, s-NNF, DNNF and DNF. The property is trivially satisfied by these languages since 
decomposability is only concerned with and-nodcs, and since every NMF formula can be 
turned in polynomial time into an equivalent smoothed NNF formula. 

— d-NNF and BDD. Direct consequence from the fact that d-NNF and BDD satisfies both AC 
and ^C. Especially, it is well-known that the disjunction of two BDDs a and /3 can 
be easily computed by connecting the 0-sink of a to the root of /? (see the proof of 
Lemma A. 8). The size of the resulting BDD is just the sum of the sizes of the respective 
BDDs of a and /?. Accordingly, we can repeat this operation n times in time polynomial 
in the input size. 

— f-NNF. Since f-NNF does not satisfy AC but satisfies -iC, it cannot satisfy VC (due to 
De Morgan's laws). 

— FBDD, OBDD, OBDD<, CNF, PI, IP and MODS. It is straightforward to convert any term into 
an equivalent formula from any of the previous languages in polynomial time. In the 
proof of Proposition 3.1, we show specific DNF formulas which cannot be turned into 
equivalent FBDD (resp. OBDD, OBDD<, CNF , PI, IP and MODS) formulas in polynomial space 
(see Tables 9, 10 and 15). Hence, the conversion cannot be accomplished in polynomial 
time either. This imphes that none of FBDD, OBDD, OBDD<, CNF, PI, IP and MODS satisfies 
VC. 

— d-DNNF and sd-DNNF. Immediate form the fact that none of these classes satisfies VBC 
unless P = NP (see below). 

• VBC. 

— NNF, d-NNF, DNNF, s-NNF, BDD and DNF. Immediate since each of these languages satisfies 
VC. 

— OBDD<. Well-known fact (Bryant, 1986). 

— OBDD, FBDD, d-DNNF and sd-DNNF. Checking whether the conjunction of two OBDD< formu- 
las ai and 0:2 (w.r.t. two different variable ordcrings <) is consistent is NP-complcte (see 
Lemma 8.14 in (Meinel & Theobald, 1998)). Now, ai A 02 is inconsistent iff -lai V -•a2 
is valid. Since OBDD satisfies -iC, an OBDD formula equivalent to -lai (resp. -102) can 
be computed in time polynomial in |ai| (resp. |a2|). Since OBDD C FBDD C d-DNNF, the 
resulting formulas are also FBDD and d-DNNF formulas. If OBDD (resp. FBDD, d-DNNF) 
would satisfy VBC, then an OBDD (resp. FBDD, d-DNNF) formula equivalent to -lai V -■q;2 
could be computed in time polynomial in |q:i| + |q!2|- But since d-DNNF satisfies VA, 
this is impossible unless P = NP. Finally, since every d-DNNF formula can be turned in 
polynomial time into an equivalent sd-DNNF formula, sd-DNNF cannot satisfy VBC unless 
P = NP. 

— f-NNF. Since f-NNF does not satisfy ABC but satisfies -iC, it cannot satisfy VBC (due 
to De Morgan's laws). 

— CNF. If ai and a2 are two CNF sentences, then we can construct a CNF sentence which 
is equivalent to ai V a2 by simply taking all the disjunctions of one clause from ai and 
one clause from a2, while removing redundant literals inside the resulting clauses and 
removing any valid clause in the result. The conjunction of all the resulting clauses is a 
CNF sentence equivalent to ai V a2, and it has been computed in polynomial time. 

— PI. Let PI{a) be the set of prime implicates for sentence a. We have PI{ai V 0:2) = 
min({/3i V/?2 I /3i G P/(ai),/32 G PI{a2)}, \=)- See Proposition 40 in (Marquis, 2000). 
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— IP. Let ai = /\i^iPi and a2 = Vi=i y^i{~'Pi^1i,j)- Sentence ai has one prime implicant 
and a2 has m* k prime impHcants. But ai V a2 has {m+ 1)'' + m* k prime implicants 
(Chandra & Markowsky, 1978). 

— MODS. Let ai = Ar=i ^^"^ = 2/- Sentence ai has 1 model over Vars{ai) and a2 has 
1 model over Vars{a2)- But ai V a2 has 2" + 1 models over Vars{ai) U Vars(a2). 



— NMF, s-NNF, f-MNF, BDD, FBDD, OBDD and OBDD<. The property is obviously satisfied by 
NNF. s-NNF also satisfies since every NNF formula can be turned in polynomial time 
into an equivalent s-NNF formula. f-NNF satisfies since applying De Morgan's laws 
on a f-NNF formula results in a f-NNF formula. Finally, for all the forms of BDDs, it is 
sufficient to switch the labels of the sinks to achieve negation (Bryant, 1986). 

— CNF. Because the negation of a DNF formula is a CNF formula that can be computed in 
polynomial time, if CNF would satisfy ^C, then it would bo possible to turn any DNF 
formula into an equivalent CNF formula in polynomial time (by involution of negation). 
But we know that it is not possible in polynomial space since CNF ^ DNF(see the proof of 
Proposition 3.1). Hence, CNF does not satisfy -iC. 

— DNF. Dual of the proof just above (just replace CNF by DNF and vice-versa). 

— PI. The formula S„ = Ar=o^(^2i V X2i+i) is in prime implicates form (see the proof of 
Proposition 3.1, Tabic 9). This formula has exponentially many prime implicants, that 
are just the negations of the prime implicates of -iSn- Since -iSn has exponentially many 
prime implicates, it cannot be the case that PI satisfies -iC. 

— IP. We just have to take the dual of the above proof (prime implicates case). The formula 

I]„ = Vr=o^(^2iAa;2i+i) is in prime implicants form. This formula has exponentially many 
prime implicates, that are just the negations of the prime implicants of -iS„. Since 
has exponentially many prime implicants, it cannot be the case that IP satisfies -iC. 

— DNNF. The negation of any CNF formula can be computed in polynomial time as a DNF 
formula, hence as a DNNF formula. If DNNF would satisfy -iC, then it would be possible 
to turn a CNF formula into an equivalent DNNF one (by involution of negation). Because 
DNNF satisfies CO, we would have P = NR. 

— d-NNF. Following is a procedure for negating a d-NNF sentence A:^ 

* Traverse nodes in the DAG of A, visiting the children of a node before you visit the 
node itself. When visiting a node, construct its negation as follows: 

• true is the negation of false. 

■ false is the negation of true. 

■ A{N[ , . . . , N'l,) is the negation of\/{Ni, . . . , N/-). Here, N- is the node representing 
the negation of Ni. 

■ V{a{N[,Mi), A(iV^, Mfe)) is the negation of A{Ni, Nk). Here, N( is the 
node representing the negation of Ni, and Mj is a node representing the conjunc- 
tion A^i A ... A A^i_i. 

* Return the negation of the root of d-NNF A. 

We can implement the above four steps so that we when we visit a node with k children, 
we only construct 0{k) nodes and 0{k) edges. ^'^ Hence, the procedure complexity is 

9. Mark Hopkins pointed us to this procedure. 
10. Wc assume that any or-node (resp. and-node) with less than two children is removed and replaced by its unique 
child or by false (resp. true) if it has no children. This simplification process is equivalence-preserving and it 
can be achieved in time linear in the size of the input DAG. 
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linear in the size of the original d-NNF. It is easy to check that the result is equivalent to 
the negation of the given d-NNF sentence and is also in d-NNF. 

- sd-DNNF and d-DNNF. Unknown. 

— MODS. E = Ar=i •''i model over ljr=i negation -iS has 2" — 1 
models over IJlLii^*}- Hence MODS cannot satisfy -iC. □ 

References 

A. Cimmati, E. Giunchiglia, F. G., & Traverse, P. (1997). Planning via model checking: a decision 
procedure for ATZ. In Proceedings of the 4*'* European Conference on Planning (ECP'97), pp. 
130-142. 

Blum, M., Chandra, A. K., & Wegman, M. N. (1980). Equivalence of free Boolean graphs can be 
decided probabilistically in polynomial time. Information Processing Letters, 10(2), 80-82. 

Boole, G. (1854). An investigation of the laws of thought. Walton and Mabcrlcy, London. 

Boufkhad, Y., Grcgoire, E., Marquis, P., Mazure, B., & Sais, L. (1997). Tractable cover compilations. 
In Proc. of the 15*'' International Joint Conference on Artificial Intelligence (IJCAI'Q?), pp. 
122 127, Nagoya. 

Bryant, R. E. (1986). Graph-based algorithms for Boolean function manipulation. IEEE Transac- 
tions on Computers, C-35, 677-691. 

Bryant, R. E. (1992). Symbolic Boolean manipulation with ordered binary decision diagrams. ACM 

Computing Surveys, 24{3), 293-318. 

Cadoli, M., & Donini, F. (1997). A survey on knowledge compilation. AI Communications, 10, 
137-150. (printed in 1998). 

Cadoli, M., Donini, F., Liberatore, P., & Schaerf, M. (1996). Comparing space efficiency of prepo- 
sitional knowledge representation formalisms. In Proc. of the 5'"'^ International Conference on 

Knowledge Representation and, Reasoning (KR'96), pp. 364-373. 

Chandra, A., & Markowsky, G. (1978). On the number of prime implicants. Discrete Mathematics, 
24, 7-11. 

Darwiche, A. (1999). Compiling knowledge into decomposable negation normal form. In Proceedings 
of International Joint Conference on Artificial Intelligence (IJCAI'QO), pp. 284-289. Morgan 
Kaufmann, California. 

Darwiche, A. (2001a). Decomposable negation normal form. Journal of the ACM, 48{^), 608-647. 

Darwiche, A. (2001b). On the tractability of counting theory models and its application to belief 
revision and truth maintenance. Journal of Applied Non-Classical Logics, ii (1-2), 11 34. 

Darwiche, A., & Huang, J. (2002). Testing equivalence probabilistically. Tech. rep. D-123, Computer 
Science Department, UCLA, Los Angeles, Ca 90095. 

de Kleer, J. (1992). An improved incremental algorithm for generating prime implicates. In Proc. 
of the 10*^ National Conference on Artificial Intelligence (AAAr92), pp. 780-785, San Jose, 
California. 

Dechter, R., & Rish, I. (1994). Directional resolution: the Davis-Putnam procedure, revisited. In 
Proceedings of the Fourth International Conference on Principles of Knowledge Representation 
and Reasoning (KR'94), pp. 134 145, Bonn. 

del Val, A. (1994). Tractable databases: How to make propositional unit resolution complete through 
compilation. In Proceedings of the International Conference on Principles of Knowledge Rep- 
resentation and Reasoning (KR'94), PP- 551-561. Morgan Kaufmann Publishers, Inc., San 
Mateo, California. 



263 



Darwiche & Marquis 



Gergov, J., & Mcincl, C. (1994a). EfRcicnt analysis and manipulation of obdds can be extended to 
fbdds. IEEE Transactions on Computers, 43(10), 1197-1209. 

Gergov, J., & Meinel, C. (1994b). On the complexity of analysis and manipulation of Boolean 
functions in terms of decision diagrams. Information Processing Letters, 50, 317-322. 

Gogic, G., Kautz, H., Papadimitriou, C, & Sclman, B. (1995). The comparative linguistics of 
knowledge representation. In Proc. of the 14*'' International Joint Conference on Artificial 
Intelligence (IJCAroS), pp. 862-869, Montreal. 

Herzig, A., & Rifi, O. (1999). Propositional belief base update and minimal change. Artificial 
Intelligence, 115{l), 107-138. 

Karp, R., & Lipton, R. (1980). Some connections between non-imiform and uniform complexity 
classes. In Proc. of the 12*'' ACM Symposium on Theory of Computing (STOC'80), pp. 302- 
309. 

Khardon, R., & Roth, D. (1997). Learning to reason. Journal of the ACM, 44{5), 697-725. 

Lang, J., Liberatorc, P., & Marquis, P. (2000). Propositional independence — Part I: formula-variable 

independence and forgetting. Submitted. 

Madre, J. C., & Coudert, O. (1992). A new method to compute prime and essential prime implicants 
of boolean functions. In Advanced research in VLSI and parallel systems, Proceedings of the 
Brown/MIT conference, pp. 113-128. 

Marquis, P. (2000). Consequence finding algorithms. Vol. 5 of Handbook of Defeasible Reasoning 
and Uncertainty Management Systems: Algorithms for Uncertain and Defeasible Reasoning. 
Kluwer Academic Publishers. 

Marquis, P. (1995). Knowledge compilation using theory prime implicates. In Proc. International 
Joint Conference on Artificial Intelligence (IJCAFOB), pp. 837-843. Morgan Kaufmann Pub- 
lishers, Inc., San Mateo, California. 

Meinel, C., & Theobald, T. (1998). Algorithms and Data Structures in VLSI Design: OBDD Foun- 
dations and Applications. Springer. 

Papadimitriou, C. (1994). Computational complexity. Addison-Weslcy. 

Quine, W. (1955). A way to simplify truth functions. American Mathematical Monthly, 52, 627-631. 

Quine, W. (1959). On cores and prime implicants of truth functions. American Mathematical 
Monthly, 66, 755-760. 

Reiter, R., & de Kleer, J. (1987). Foundations of assumption-based truth maintenance systems: 
Preliminary report. In Proceedings of the Fifth National Conference on Artificial Intelligence 
(AAAI), pp. 183-188. 

Roth, D. (1996). On the hardness of approximate reasoning. Artificial Intelligence, 82{l-2), 273-302. 

Schrag, R. (1996). Compilation for critically constrained knowledge bases. In Proc. of the 13*'' 

National Conference on Artificial Intelligence (AAAF96), pp. 510-515, Portland, Oregan. 

Selman, B., & Kautz, H. (1996). Knowledge compilation and theory approximation. Journal of the 
Association for Computing Machinery, 43, 193-224. 

Simon, L., & del Val, A. (2001). Efficient consequence finding. In Proc. of the 17*'' International 
Joint Conference on Artificial Intelligence (IJCAFOl), pp. 359-365, Seattle (WA). 

Wegener, I. (1987). The complexity of boolean functions. Wiley- Teubner, Stuttgart. 



264 



